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Abstract 

We consider scalar conservation laws with convex flux and random ini- 
tial data. The Hopf-Lax formula induces a deterministic evolution of the 
law of the initial data. In a recent article, we derived a kinetic theory and 
Lax equations to describe the evolution of the law under the assumption 
that the initial data is a spectrally negative Markov process. Here we show 
that: (i) the Lax equations are Hamiltonian and describe a principle of 
least action on the Markov group that is in analogy with geodesic flow on 
SO{N); (ii) the Lax equations are completely integrable and linearized via 
a loop-group factorization of operators; (iii) the associated zero-curvature 
equations can be solved via inverse scattering. Our results are rigorous for 
Af-dimensional approximations of the Lax equations, and yield formulas 
for the limit A'^ — )■ oo. The main observation is that the Lax equations 
are a A'^ — >■ oo limit of a Markovian variant of the A'^-wave model. This 
allows us to introduce a variety of methods from the theory of integrable 
systems. 

MSG classification: 35R60, 37K10, 60J35, 60H99, 82C99, 35L67 
Keywords: Shock clustering, stochastic coalescence, kinetic theory, integrable 
systems. Burgers turbulence, N-wave model. 



1 Introduction 



1.1 Turbulence and flows of probability measures 

A fundamental problem in the statistical theory of turbulence is to construct 
random incompressible velocity fields that model isotropic homogeneous turbu- 
lence. Such random fields must be supported on weak solutions to the Euler 
equations that dissipate kinetic energy in accordance with the criterion of Kol- 
mogorov and Onsager [22, 25, 35, 41]. This problem is currently out of reach. 
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Much of our understanding is based instead on vastly simplified models. One 
such model, proposed by Burgers, is to understand the statistics of the Cole- 
Hopf (or entropy) solution to Burgers equation 



dtu + dx 




0, a;eM,t>0, 



(1.1) 



with random initial data Uq such as white noise [18, 43]. A closely related prob- 
lem is to consider (1.1) with random forcing, and to understand the associated 
equilibrium measure. While we focus on the unforced equation in this article, 
some of our methods also apply to (1.1) with random forcing [38]. This class 
of problems is called Burgers-KPZ turbulence. An explanation of its role in 
statistical hydrodynamics may be found in [23]. 

Random initial data also arise in applications unrelated to turbulence. For 
example, the coarsening of domains in the kinetics of phase transitions is mod- 
eled by solutions to the Allen-Cahn and Cahn-Hilliard equations that emerge 
from disorder [37]. In such problems, the equations of continuum physics in- 
duce an evolution of the law of the initial data. The first clear formiilation of 
an evolution equation for the law of solutions seems to be due to Hopf [30]. In 
this article, we show that this problem is suprisingly rich, even in the setting of 
perhaps the simplest nonlinear equations. 

We consider the scalar conservation law 



with a strictly convex, flux /. The Hopf-Lax formula defines a unique entropy 
solution to (1.2). When the initial data uq is random, the Hopf-Lax formula 
induces a deterministic evolution of the law of uq. Our main contribution is 
to show that if uq is a Markov process in x with only downward jumps or a 
limit of such processes, then the evolution of its law is completely integrable. 
Our results include Burgers model, but the assumption f{u) = tt^/2 is not 
necessary. Neither do we require a special choice of initial condition such as 
white noise or Brownian motion (though these yield important exact solutions). 
For all convex, f and a broad class of random initial data the evolution 
of the law of u{-,t) is given by kinetic equations for shock clustering that are 
a continuum limit of a Markovian variant of the N-wa.ve model. The iV-wave 
model generalizes the three-wave model of Manakov and Zakharov in nonlinear 
optics [47] and is well-known to be completely integrable [3, p. 55], [48, III. 4]. 
We show that this model also underlies Burgers turbulence and shock clustering 
for (1.2) with arbitrary convex /. We stress that it is the evolution of the law 
oiu{-,t) that is integrable, and this is completely distinct from the integrability 



This is surprising enough, but more is true. Our kinetic equations sit at a 
rich juncture of problems: (i) geodesic flows on Lie groups [36]; (ii) the inte- 
grable systems of the 19th century, as recast by Moser [39]; (iii) the completely 
integrable systems of random matrix theory discovered by the Kyoto school [31]; 



dtu + dxf{u)=0, xGR,t>0, u{x,0)=uo{x) 



(1.2) 



of (1.1). 
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(iv) intcgrablc hierarchies on groups [45]; and (v) asymptotic problems in rep- 
resentation theory [33]. This reveals a close and unexpected relation between 
the theory of Markov processes, kinetic theory and integrable systems. 

This confluence of ideas is qiiite bewildering, and a full explanation for their 
role in what should be a purely probabilistic problem still eludes us. Some 
heuristic explanation is perhaps the following: At its heart, complete integra- 
bility is an explicit imderstanding of the hidden symmetries of a Hamiltonian 
system. On the other hand, understanding continuous deformations of the law 
of a stochastic process is a basic problem in probability theory (e.g., the Gir- 
sanov theorem may be viewed in this light). The deeper principles here seem 
to be that: (i) the space of Feller processes with bounded variation on the Sko- 
rokhod space can be given a natural symplectic structure; (ii) for every convex, 

flux / (1.2) induces a Hamiltonian flow on this space with respect to this 
symplectic structure; (iii) these flows commute for distinct /. A precise formula- 
tion of these ideas is subtle, and we have been unable to develop this viewpoint 
completely, even though we obtain partial results. In order to state precisely 
what we prove, and what is mere conjecture, we first review some recent work. 

1.2 Lax equations for shock clustering with McU"kov data 

We always assume that random initial data uq for (1.2) is a Markov process in 
X with only downward jumps (a spectrally negative Markov process) or a limit of 
such processes. This assumption is motivated by two considerations. First, in 
order to obtain a detailed understanding of the evolution of the law of Uq under 
(1.2), it is necessary to work with a class of well-understood random processes on 
the line and it is natural to choose Markov processes. Second, it is a surprising 
fact that for every t > the solution to Burgers equation with white noise is 
a stationary, spectrally negative Markov process in x. The Markov property of 
this solution was assumed by Burgers, and first proved (in a completely different 
context) by Groeneboom [29]. The importance of the Markov property in the 
context of Burgers turbulence was first noted by Avallaneda and E [11]. 

In a recent article, we proved the following closure theorem for the entropy 
solution to (1.2): Assume the initial data Uo{x) is a spectrally negative strong 
Markov process in x. Then for every t > the entropy solution to (1.2) remains 
a spectrally negative Markov process in x [38, Thms.2,3]. This shows that 
the entropy solution to (1.2) leaves this class of stochastic processes invariant. 
It is not necessary to assume that /(w) = only that / is convex and 

C^. There is also no need to assume that uo is stationary in x. Under an 
additional assumption of regularity (preservation of the Feller property), the 
closure theorem forms the basis for a kinetic theory of shock clustering as follows. 

Feller processes are characterized by their generators. The simplest such 
characterization is tlic Levy-Khintchine formula for Levy processes. A general 
characterization, attributed to Courrege in [8, Thm 3.5.3] builds on the Levy- 
Khintchine formula. Assume u{x), x gM. is a. stationary. Feller process. Then 
its generator A is an integro-differential operator that acts on test functions 
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in its domain as follows: 



A<p{u) = a{u)(p"{u) + h{u)ip'{u) + c{u)tp{u) (1.3) 

+ / (V'(w) — ip{u) — ip{u, v)ip'{u)) n{u, dv). 
Jm\{u} 

Here a,b,c arc functions on the line satisfying certain continuity criterion, n{u, dv) 
is a Levy measure that describes the jumps of the process, and ip{'>^j is a local 
unit (in the simplest situation, we have iIj{u,v) = v — u). The precise assump- 
tions are stated in [8, Thm. 3.5.3]. There is an intimate relation between this 
characterization and the sample paths of the Feller process: a describes the 
diffusion of the Feller process, so that a > 0; c describes killing, so that c < 0; 
b describes the drift, and its sign is not restricted. 

Our basic idea, following [21], is to study the shock statistics through an evo- 
lution equation for the generator. The general formula (1.3) simplifies because 
for any t > 0, the entropy solution to (1.2) has bounded variation, only down- 
ward jumps, and no killing. Thus a and c vanish, and the support of n{u, dv) 
is (— oo,u). For fixed f > 0, if u{x,t), a; G M is a stationary Feller process, its 
generator A{t) is an integro-diff'erential operator of the form 

A{t)(p{u) ^ b{u,t)ip'{u) + [ {ip{v) - ip{u))n{u,dv,t). (1.4) 



One of the main results in [38] is that A{t) satisfies the Lax equation 

dtA=[A,B]. (1.5) 

Here [A, B] = AB — BA denotes the Lie bracket, and the operator B is defined 
by its action on test functions as follows: 

Btp{u) = -f'{u)b{u,t)tp'{u) - [f]u,v{v{v) - 'p{u))n{u,dv,t). (1.6) 

J — oo 

[f]u,v is abbreviated notation for the Rankine-Hugoniot speed of a shock con- 
necting states u and v. 

We do not need to assume that the process u{-,t) is stationary in x. Non- 
stationary data arise when we wish to model the spread of 'turbulent bursts' 
(i.e. localized data Ao{x)) or Riemann data (e.g one-sided initial conditions as 
in [46]). These situations are described by the zero-curvature equation 

dtA-d,B=[A,B]. (1.8) 

It requires considerable insight to realize that this approach is fruitful, and 
our work was greatly inspired by Duchon and co-workers [19, 20, 21]. Several 
open questions remain: In particular, it is still necessary to justify the connection 
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between (1.2) with random data and (1.8) in full generality [38, §1.6]. Our 
earlier work only showed that the entropy solution preserves the strong Markov 
property, whereas we really need to show that it preserves the Feller property. 
This requires a well-posedness theory and uniform estimates for (1.8). One of 
our goals in this article is to lay a foundation for the analysis of (1.8) that also 
allows us to address this question. 



1.3 Kinetic equations 

To convince the reader of the merit of this approach, let us briefly explain how it 
describes the evolution of shock statistics. Since the generator A is characterized 
by the drift, 6, and jump measure, n, the Lax equation can be expanded using 
(1.4) and (1.6) to yield evolution equations for b and n. The drift satisfies the 
simple differential equation 

dtb{u,t) = -f"{u)b\u,t). (1.9) 

The evolution of n is more interesting, since the shock statistics evolve by de- 
cay of rarefaction waves and growth in binary collisions. To state the evolu- 
tion equations, we assume for simplicity that the jump measure has a density, 
say n{u,dv,t) = n{u,v,t) dv. Then n satisfies a kinetic equation of Vlasov- 
Boltzmann type 

dtn{u,v,t) + du {nVu{u,v,t)) + {nVv{u,v,t)) (1-10) 
= Q(n, n)+n (([/]„,„ - f'{u)) dub - bf"{u)) . 

Here the drift velocities Vu and Vy are defined by 

Vu{u,v,t) = ([/]„,„ - f'{u))b{u,t), V4u,v,t) = {[fUv - f'{v))biv,t), 

(1.11) 

and the collision kernel Q counts growth and loss in binary clustering 



Q{n,n){u,v,t) = / {[f\u,w -[f\w,v)n{u,w,t)n{w,v,t)dw 

J V 

{[f]u,v - [f]v,w) n{u, V, t)n{v, w, t) dw 
i[f]u,w - [f]u,v) n{u, V, t)n{u, w, t) dw. (1.12) 



/ 

J — ( 

-/ 

J — c 

The distinction with earlier work on kinetics of shock clustering [24, 34] is sum- 
marized in [38]. 



1.4 Exact solutions 

Despite their formidable appearance, 
exact solutions for Burgers equation, 
to spectrally negative Levy processes 



these kinetic equations admit surprising 
The first class of solutions correspond 
(e.g. when uq is a Brownian motion as 
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in [44]). Wc then have Ae^y = tp{q,t)e'^y and [A,B]eiy = -tpdqtp{q,t), where 
'4'{q,t) denotes the Laplace exponent of the process u{x,t) — u{0,t), x > 0, for 
fixed t > 0. Then the Lax equation (1.5) yields 

dtip + ipdqij^O, t>0,q>0. (1.13) 

The beautiful fact that the Laplace exponent itself evolves by Burgers equation 
was discovered by Carraro and Duchon [19], and made rigorous by Bertoin [16]. 
It is reminesccnt of the inverse scattering method in intcgrablc systems. In 
addition, (1.10) reduces to Smoluchowski's coagulation equations with addi- 
tive kernel. It has been known for some time that Smoluchowski's equation 
can be solved explicitly by the Laplace transform [28]. But this solution takes 
on new meaning when we recognize that it describes exactly the clustering of 
shocks in Burgers equations with Levy process data. This connection is the 
basis for sciveral deeper results connecting stochastic coalescence and Burgers 
turbulence [17]. 

Another remarkable solution to (1.5) corresponds to the shock statistics in 
Burgers equation with white noise initial data [27, 29]. This corresponds to a 
self-similar solution to (1.9) and (1.10) of the form 

b{u,t) = ^, n{u,v,t) = -^n^{ut^'^,vt^'^). (1.14) 
The jump density n* of the integral operator is given explicitly as follows: 

nJu,v) = '^}'^\ k[u — v), u>v, (1-15) 
J{u) 

and vanishes if u> v. Here J and K are positive functions defined on the line 
and positive half-line respectively, whose Laplace transforms 



/OO pOC 
e-'iyj{y)dy, k{q) ^ e-^yKiy)dy, (1.16) 
OO Jo 



are meromorphic functions on C given by 



M = TT7T^, i;(j) = -2— logAi(,). (1.17) 



Ai denotes the Airy function [4, 10.4]. This generator was computed by Groene- 
boom (but not as a solution to (1-5)!) [29]. When written this way, the formula 
for k is reminescent of determinantal formulas in soliton theory. 



1.5 Discrete LcLX equations and the Markov A^-wave model 

The most intriguing aspect of [38] is that the kinetic theory of shock clustering 
has many features reminiscent of completely integrable systems. These include 
the formulation as a Lax pair, explicit exact solutions as above, a Painleve prop- 
erty for the self-similar solution, and links with random matrix theory (see [38, 
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§1.5.1]). Our main contribution here is to understand the origin of these coin- 
cidences. 

In order to show that (1.5) is completely integrable, we must show that it de- 
fines a Hamiltonian system and construct infinitely many commuting integrals. 
In addition, what is required is an explicit solution via inverse scattering or a 
Riemann-Hilbert problem. In order to address these questions, we first study 
exact finitc-dimcnsional discretizations of (1.5) (this terminology is explained 
below) . Here the definition of Hamiltonian structure and complete integrability 
are unambiguous, and it has to be shown that (1.5) actually has this structure. 
This lays a foundation for the analysis of (1.5) and (1.8). 

We discretize the system as follows. Fix a positive integer N and discrete 
velocities — oo < mi < W2 < • • • < mjv < oo. We consider a continuous x 
Markov processes that takes the values Uk, 1 < k < N. The sample paths of 
this Markov process are piecewise constant paths. Let Mat denote the space of 
N X N matrices. The generator A e M.^ satisfies 

N 

Aij>0,iy^j, ^^i,=0. (1.18) 

i=i 

More formally, A corresponds to an operator A with drift 6 = and a jump 
measure n{uj, dv) = ^k=^j AjkSu^idz). The jump measure vanishes if u is not 
one of the discrete states uj. In order to define the discretization B of B we 
introduce the symmetric matrix 

p,__fM^iMj^j, F,, = -f'iui). (1.19) 

Here / is the flux in the conservation law (1.2), so that —Fij is the Rankine- 
Hugoniot speed of the shock connecting states Ui and Uj. We then set 

Bij = FijAij, i j, Bu = -Y^ Bij. (1.20) 

The discrete zero-curvature equation is 

dtA-d,B = [A,B], (1.21) 
and the discrete Lax equation is 

A=[A,B]. (1.22) 

Let us briefly comment on why this discretization is natural. Initial data to 
(1.21) correspond to random initial data uq to (1.2). In particular, initial data 
to (1.21) that are lower triangular corresponds to wq that are spectrally negative 
Markov processes with a finite number of velocities. For such initial data, the 
Feller property is preserved in time and the civolution of statistics of random 
initial data by the conservation law (1.2) is described exactly by (1-21). Thus, 
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solutions to (1.21) that are lower triangular and generators of Markov processes 
correspond to exact solutions to (1.8). This is what we mean when we say that 
the discretization is exact. 

When wo consider (1.22), complete integrability immediately becomes plau- 
sible. Indeed, ordinary differential equations of the form (1.22) arise in basic 
examples in the theory of integrable systems: these include geodesic flows on 
SO{N) [36], the integrable flows of Neumann and Jacobi [39], and the inte- 
grable flows of random matrix theory [31]. Moreover, the zero-curvature equa- 
tions (1.21) are very similar to the A/'- wave model [3, 48] with an important 
difl^erence. For the TV-wave model we typically assume A E u, the algebra of 
the unitary group. (Perhaps the only study where A is not assumed to lie in 
u is [1]). But A e u is incompatible with the Markov property. Motivated by 
these considerations, we call (1.21) the Markov N-wave model, or mA/'-wave 
model for short. 

1.6 Statement of results 

A complete study of (1.5) and (1.8) requires a combination of methods from 
the; theory of Markov processes, integrable systems and spectral theory. Our 
approach is to prove complete results in the finite-dimensional setting in this 
article, and to study the limit iV oo in a sequel. Once one has recognized the 
structure of the problem in the discrete setting, the proofs only rely on well- 
established techniques from integrable systems. (Of course, the main difficulty 
in our work was to recognize this structure!). Our main results are: 

1. (1.22) is Hamiltonian and is associated to a principle of least action (see 
Section 2 and Theorem 1). 

2. (1.22) is completely integrable and linearized by a loop-group factorization 
(see Section 3 and Theorem 5). 

3. an inverse scattering theory and well-posedness for (1.21) (see Section 4, 
in particular Theorem 11). 

Though obviously related, the method of solution for (1.5) and (1.8) turn out to 
be quite distinct. The Lax equation (1.5) can be studied through an operator 
factorization problem, while the zero-curvature equation (1.8) can be attacked 
by inverse scattering. 

Let us briefly explain how these results are proved. Hamiltonian structures 
and complete integrability are deeply linked to group actions. In seeking an 
algebraic, but probabilistically natural, approach to (1.5), we find the follow- 
ing simple structure. We observed in [38, §2.7] that the space trioo of integro- 
differential operators C of the form 

Cviu) = I3{u)^'{u)+ [ {ip{v)-^{u))v{u,v)dv, /3GC~(M),J/eC,°°(M2), 

(1.23) 
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formally constitutes a Lie algebra. Of course, it is not clear that these operators 
generate a infinite-dimensional Lie group. But we may use this insight at the 
discrete level quite easily. The set of generators of Markov processes defined by 
(1.18) forms a cone q^v C Mat. Each element of qjv generates a one-parameter 
Markov semigroup. Let us define 

Qn = {9€GL{N,R)\ = e^^, ^ e qjv, a; > 0}. (1.24) 

0Ar can be naturally embedded in a Lie group as follows. Define the Markov 
algebra 

N 

mjv = {^GMjv| ^Aij = Q, l<i<N}. (1.25) 

mjv is a Lie algebra. It generates the Markov group 

TIn = {g € GLiN,R)\ g = e='^, A e mjv, x e M.}, (1.26) 

Since N will be fixed in our results, we will mostly suppress the subscript N in 
what follows. Clearly q C m, and Q C 9Jl. Some basic properties of m and 9Jl 
may be foimd in [32]. 

The first result is that (1.22) is a Hamiltonian flow on m. In addition, when 
/ is convex and A lower triangular, this Hamiltonian flow leaves q invariant. 
These results are seen as follows. Co-adjoint orbits of Lie groups carry a natural 
(Kirillov-Kostant) symplectic structure. In addition. Lie algebras that admit 
direct sum decompositions into subalgebras carry more than one symplectic 
structure. This idea has been formalized by the notion of an r-matrix [42]. 
The Lie algebra q = gl{N,M.) = Mjv admits a natural splitting induced by m 
(see (2.2) below). We show that (1.22) is a Hamiltonian system on the algebra 
with a Lie-Poisson bracket induced by this splitting. This is stated precisely 
in Theorem 1 below. Once we have established the Hamiltonian structure of 
(1.22) we treat the probabilistically important case {A G q) in Theorem 3. The 
proof shows clearly the role of convexity of / and spectral negativity at the level 
of the Lax equations. 

We then formulate an associated principle of least action for (1.22). This is 
in precise analogy with geodesic flow on SO{N) and in particular, with Euler's 
equation for a free rigid body. When Fij is positive, the role of the flux / is to 
deflne a (degenerate) metric on dJl. More generally, / defines a quadratic action 
through the multiplier F^j . This principle of least action describes the evolution 
of a probability measure on path space through (1.22), and is completely distinct 
from the usual principle of least action for the Hopf-Lax solution to (1.2). 

Complete integrability is based on an elegant observation of Manakov [36]. 
Define the diagonal matrices 

= diag(?ii,...,?iAr), AA = diag(/(ui),...,/(u7v)), (1-27) 

and observe that A and B are related through the algebraic relation 

[A,Af\-[M,B]=0. (1.28) 
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This allows us to introduce a spectral parameter ^; S C in (1.22) and embed the 
flow in a loop-algebra 



Thus, the spectral curve {{z, A) G C^| det(yl — zM — XT) = 0} is invariant, and 
its coefficients are integrals. Rather than verify explicitly that these integrals 
arc in involution as in [36], it is simpler to apply the Adler-Kostant-Symes 
(AKS) theorem to show that (1.29) is completely integrable. Again we need 
to find a suitable r-matrix, now of the loop algebra. This is stated precisely in 
Theorem 5. The matrix factorization in the AKS theorem also yields a Riemann- 
Hilbert problem. Once this is formulated for finite N, it also yields a limiting 
factorization problem for the Lax equation (1.5). 

Scattering and inverse scattering for a class of integrable systems including 
the TV-wave model on u was established by Beals and Coifman [13, 14] and 
formulated in a general Lie algebraic setting by Beals and Sattinger [15]. This 
theory does not directly apply to (1.21) since m is not semi-simple. We mod- 
ify [13, 14] to obtain a scattering and inverse scattering theory for (1.21). Among 
other results, we obtain a hierarchy of integrable flows and global well-posedness 
theorems for (1.21) including the probabilistically natural case. These results 
are contained in Section 4. In addition to these rigorous results, the method 
also yields a formal solution procedure for (1.8) via inverse scattering. 

In the limit N — > OC', A is an intcgro-differential operator of the form (1.4). 
Subtle problems arise in the approximation: under suitable assumptions, an 
operator of the form (1.3) can be approximated by operators in rrijv. However, 
operators of the general form (1.3) are not closed midcr the commutator! In 
addition, while operators of the form (1.23) form a Lie algebra, it is not clear 
that they correspond to an infinite-dimensional Lie group. Nevertheless, our 
work yields a formal understanding of (1.5) and (1.8), and thus (1.2) with spec- 
trally negative Markov data. Theorem 1 suggests that (1.5) is Hamiltonian with 
an associated principle of least action on the semigroup of Markov operators. 
Theorem 5 suggests that (1.5) is completely integrable, and yields an operator 
factorization problem for (1.5). Finally, the inverse scattering problem for (1.8) 
extends formally to unbounded operators with little change. We hope that these 
natural conjectures will stimulate rigorous results of full generality. 

2 Hamiltonian structure 

In this section, we show that the Lax equation (1.22) defines a Hamiltonian 
system. As is well-known, co-adjoint orbits of a Lie group carry a natural 
symplectic structure [7, 10]. If g denotes a finite-dimensional Lie algebra, and H 
a smooth Hamiltonian defined on the dual space g*, then Hamilton's equations 
may be rewritten as Kirillov's equation 



— {A-zM) = [A-zM,B + zAf\, zeC. 



(1.29) 



a = addH(a)(a), 



a e g*. 



(2.1) 
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Our main observation is that the Lax equation (1.22) takes this form on the 
algebra g with a probabilistically natural bracket defined in (2.5) below. 

2.1 Algebraic preliminaries 

Let Q = gl{N, R) denote the Lie algebra of real, N x N matrices equipped with 
the bracket [A, B] = AB — BA. We have already defined the subspace m C 
in (1.25). It is easily checked that m is a subalgebra of g. Let D denote the 
subspace of diagonal matrices. Since diagonal matrices commute, is trivially 
a subalgebra. Its importance here lies in the fact that Q admits a direct sum 
(vector space) decomposition 

= me(). (2.2) 

The decomposition (2.2) is obtained as follows. Recall that e = (1, 1, . . . , 1)^. 
Given A G q, we define the projections 

PmA = A-di&g{Ae), PjA = diag(Ae). (2.3) 

Then = Pm, P^ = A, Pm-^ G m and P^A e 5, and every matrix A G Q may 
be written as 

A = P^A + P^A. (2.4) 

Associated to a splitting such as (2.2) is an r-matrix [42]. This allows us to 
introduce a new Lie bracket on q. li A,B g g, then we define a new ad-action 

ad^jA = [S, A]r = [P^B, P„,A] - [P,B, P,A] = [P^B, P„,A]. (2.5) 

The last equality holds because diagonal matrices commute, g remains a Lie 
algebra with the new bracket [•, -Jr. 

We identify g with its dual space g* through the non-degenerate Ad-invariant 
pairing 

{a, A) = Tr(aA), a G g*, A G g. (2.6) 

The dual spaces m* and 5* are naturally identified with the orthogonal comple- 
ments J)""" and m""" under (■, ■). It is easy to compute 

m* ^ 5-^ = {a\ diag(a) = 0}. (2.7) 

Since Ae = for every A Gm, and the dimension of m is iV^ — N we also find 

N 

D*^m^ = {a\a = J2 ^jEj = 0}, (2.8) 

where Cj G R, 1 < j < A'^ and Ej is the matrix obtained from the zero matrix 
by replacing the j-th column with e. The projections Pm and Pa induce dual 
projections Pm± and Pj)± in the natural manner: for A G g and a G g* 

iP^^a,A) = {a,PraA), {P^^a,A) = {a,P^A). (2.9) 
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Wc now compute the ad* action with the bracket (2.5) and the non-degenerate 
pairing (2.6). For every A, B E g and a G g* wc have 

adgaiA) = (a,adB^) = {a, [P„,B,P„,A]) 

= Tr (a[P„,B, P„,A]) = Tr {[a, P„,B]P^A) 
= {[a,P^B],P„A) = {P^4a,P,nB],A). 

Since this holds for every A G g, we find 

s^d%'a = P^4a,Pr„B]. (2.10) 

2.2 Quadratic Hamiltonians and Kirillov's equation 

The calculation so far has been purely algebraic and we have been careful to 
distinguish a E g* from A E g. Since we have now computed the ad*'" action 
and has been identified with g* via (, ) we may drop this notation. We assume 
H : g^Ris C^, replace a by A and B by dH{A) in (2.10) to obtain Kirillov's 
equation with the bracket (2.5) 

A = ad*/H^^)A = P,^ [A, PmdH{A)] (2.11) 

These calculations have made precise the Hamiltonian structure. We now show 
that the Lax equation (1.22) is Hamiltonian with this symplectic structure. 

Theorem 1. Assume given a real symmetric matrix F, and let F o A denote 
the Hadamard product {F o A)ij = FijAij. Define the quadratic Hamiltonian 
H:g^R 

H{A) = -Tr{AF o ^) = 2 E Pii^iiAii. (2.12) 

(a) The associated Hamiltonian vector field on g with the bracket (2.5) is 

A=[A,P,nFoA]. (2.13) 

(b) rw^ , , m, and c) are invariant under (2.13). 

(c) When F is defined by (1.19), the vector field (2.13) is identical to (1.22). 

Proof, (a) Kirillov's equation on g* with the bracket (2.5) takes the form (2.11). 
We only need to show that 

P^4A,P^dH{A)] ^ [A,P„,dH{A)] = [A,PrnFoA]. (2.14) 

The identity (2.14) is seen as follows. By (2.12), dH{A) = F o A and 

N 

[A, dH{A)]ij = J2 (Pkj - F^k) AikAkj. (2.15) 
fe=i 
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Since F is symmetric, [A, dH{A)] vanishes on the diagonaL In addition, since 
PT,dH{A) is a diagonal matrix, [A,Pj,dH{A)] also vanishes on the diagonal. 
Thus, by (2.7) 

P,4A,dH{A)] = [A,dH{A)], P^4A,P^dH{A)] = [A,P^dH{A)], 

which implies (2.14). 

(b) Every vector field of the form (2.11) vanishes on rti"'". Thus, all such 
vector fields (not just quadratic Hamiltonians) leave d-^ and invariant. If 
A then PmdH{A) vanishes. Thus, D is invariant. Next, m is a subalgebra. 
So if A e m, then [A, PmdH{A)] e m and m is also invariant. 

□ 

Remark 2. The following subtlety should be noted in (b). and m-'- are 
Poisson subspaces, but 5 and m are not. A subspace F C is a Poisson subspace 
if and only if the restriction of every Hamiltonian vector field on to ^ is 
tangent to V [7, Prop. 3.33]. However, it is only vector fields corresponding to 
the quadratic Hamiltonians (2.12) that vanish on and m. 

2.3 Convexity of / and spectral negativity 

We defined the Markov group in order to give the Lax equations (1.22) the 
Hamiltonian structure stated in Theorem 1. We now return to the probabilis- 
tically interesting case. We consider the initial value problem for (1.22) with 
initial data that are generators of spectrally negative Markov processes with 
discrete states — oo < ui < . . . < ujm < oo. In this discrete setting, spectral 
negativity simply means that the generator is a lower-triangular matrix. For 
brevity, let I denote the algebra of lower-triangular matrices. We then have 

Theorem 3. Assume f is a convex flux and F is defined by (1.19). Then 

(a) The vector field (1.22) leaves i invariant. 

(h) The vector field (1.22) leaves q n ( positively invariant. 

(c) For every G q fl [, there is unique, global, C°° solution A : [0, cxd) with 
initial condition A{Qi) = Aq. 

Proof, (a) is easy. If ^ e [, then B e I and [A, B] G [. (This does not require 

convexity of /). 

(b) Since we have already shown that the lower-triangular form is preserved, 
we only need to show that the flow preserves positivity. In order to establish 
positive invariancc, we show that the vector field (1.22) 'points into' q n [ for 
a point on its boundary. A point on the boundary has A e q PI t and Aij = 
for some indices {i,j) in the lower-triangular region (i.e i > j). We claim that 
Aij > so that positivity is preserved. Indeed, by (1.22) 

= H i^kj - Fik) AikAkj + (An - Ajj)Bij + {Bjj - Bu) Aij. (2.16) 
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The second and third term vanish since Aij = and Bij = FijAij — 0. We now 
consider the first term. Since A is lower-triangular, the sum extends only over 
j < k < i. For such k, uj <Uk < Ui, and the convexity of / and (1.19) imply 

i'kj - i'ik = > U. (^•J-') 

Ui -Uk Ufe - Uj 

Finally, since ^ G q fl [ we also have Aik > and Akj > 0. Thus, A^j > 0. 

(c) The vector field (1.22) vanishes on the diagonal, so that the diagonal terms 
are conserved. The diagonal terms control the off-diagonal terms since A G q. 
Thus, we have the uniform bound 

Q<Aij{t)<\Aii{t)\ = \Aum, i>j,t>0. (2.18) 

□ 

Remark 4. As in Remark 2, [ constitutes an invariant subspace, but not a 
Poisson subspace. In fact, if we set Fa = (which does not affect the flow), 
we see that the Hamiltonian H = Tr{AF o A)/2 vanishes on the subalgebra of 
lower triangular Markov matrices. 



2.4 A principle of least action 

In this section, we assume that Fij does not vanish. Equation (1.20) then defines 

an endomorphism of m, and we have the following principle of least action. Let 
£( : [0, 1] — )■ denote a path with given endpoints. Assume F^j ^ 0, define 
the endomorphism I : m ^ xn, B A given by (1.20), and define the action 

s[g] = 11't^^ {I (g-'g) g~'g) dt. (2.19) 

Then the Euler-Lagrange equations for minimizing this action are 

A=[A,B], g = gB. (2.20) 

The second equation is a linear non-autonomous equation and can be integrated 
once we have solved the first, which is of course, (1.22). 

It is easy to check this assertion, but the main point to observe is that when 
Fij > 0, a similar endomorphism of so{N) is used to define geodesic flow with 
respect to a left-invariant metric on SO{N). Let us first recall these ideas. This 
will immediately explain the origin of (2.19). 

We follow the notation of [7, Ch.8, p. 265] (see also [9, Appendix 2]). Assume 
F is a symmetric matrix with strictly positive entries and let I : so{N) — >■ so{N) 
denote the endomorphism oj i-^ I{ui) with I{co)ij = cOij/Fij. In physical terms, 
oj is the angular momentum in the body frame and X = I{ui) is the angular 
momentum in the body frame. The letter / stands for the inertia tensor. Since 
Fij > and so{N) is semi-simple, the quadratic form 

iuJi,uj2)p := Tr (/(wi)w2) , Wi, G So{N) (2.21) 
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is an inner-product on so{N). The inner-produet {■,-)f then defines a left- 
invariant metric on SO{N) by left-translation. The length of a path g : 
[0, 1] — )• SO{N) with respect to this metric is given by 

m = ^]{9-^9, 9-'9)Fdt. (2.22) 

The problem of minimizing the length is the same as that of minimizing the 
action ^ 

^^9] = ll {9-'9,9-'9)pdt. (2.23) 

The Euler-Lagrange equations for this variational principle are precisely the 
Euler equations: 

X=[X,L0], g = guj. (2.24) 

If we replace the group SO{N) with the Markov group DJl, the angular momen- 
tum X with A, and the angular velocity lo with then wc have a flow on DJl 
given by precisely (2.20). The analogy with geodesic flow on so(A^) is now clear. 

In the calculations above, we did not need to assume that Fij is of the 
form (1.19). Indeed, every symmetric matrix F with positive entries defines 
a diagonal left- invariant metric on SO{N). Geodesic flow with this metric is 
Hamiltonian, but not necessarily completely integrable for N > 4:. However, 
Manakov discovered that when 

Fii = (2.25) 

Ui - Uj 

for two vectors (/i, . • . , /jv), and (ui, . . . , u^), the geodesic flow is integrable. 
It is quite remarkable that we find ourselves in exactly this situation with the 
flux function / in (1.2) defining F as in (1.19). 

The analogy with geodesic flow is incomplete in the following respect. First, 
we do not assume that Fij > 0. Moreover, even when Fij > 0, the metric 
on m is degenerate since m is not semi-simple. (The Killing form oi A, B G Q 
is 2NTt{AB) - 2Tt{A)Tt{B). This vanishes on the identity in q, and on m if 
^ = ELi CjEj with J2f=i Cj = 0.) 



3 Complete integrability 

Complete integrability of all the systems alluded to in Section 1.1 can be es- 
tablished in the unified framework of [5, 6, 40]. There are two distinct aspects 
to these studies: the first is to establish complete integrability via a suitable 
loop algebra splitting. The second is to explicitly linearize the flow on a Jacobi 
variety. Here we only consider the first aspect of the problem. We show that 
the Lax equation (1.22) defines a completely integrable Hamiltonian system. 
The proof is almost a textbook application of the Adler-Kostant-Symes (AKS) 
theorem and we follow the treatment in [7, §4.4]. Construction of the linearizing 
transformation is more difficult and will be considered in a separate article. 
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3.1 Integrability via the AKS theorem 

We introduce the loop algebra of formal finite Laurent expansions valued in q 



L{2) = {X{z) = ^Akz'',m,n& 1 Ak & q}. 

m 

The natural Lie bracket on L{q) is given by 



(3.1) 







E [^-^^] ) 






\i+j=k J 



(3.2) 



The sum inchidcs only a finite number of terms by (3.1). We pair g with g* 
via the non-degenerate, Ad-invariant pairing (2.6). There are then various Ad- 
invariant pairings that one may introduce on L{q). We use the pairing 



{X\Y) = ^ {X„Yj) = _ Tv{X{z)Yiz)) 

i+j=0 



dz 
z 



(3.3) 



It is easily checked that this pairing is non-degenerate and Ad-invariant. 

The direct sum decomposition (2.2) also induces a decomposition of L{g). 
We define the subalgebras 



L{q)+ = {X{z) = Akz^, ^0 e m, Ak&Q,k> 1}, 

fe>0 

L(b)_ = {X{z) = Y,Akz\ ^0 6 0, AfeeB,A;<-l}. 



fe<0 



(3.4) 

(3.5) 

(3.6) 
(3.7) 



It is immediate from the calculations of Section 2.1 that 

l{q) = l{q)+®l{q)_. 

The respective projections are given by 

X{z)+ = P^Ao + X{z)- = PaAo + ^kz''. 

fe>i fc<-i 

The orthogonal complements with respect to the pairing (3.3) are given by 

L{q)1 = {Y{z) = Y,Ykz\ Foem^, Ffcefl,fc>l}, (3.8) 

fe>0 

l{q)^ = {r(z) = ^yfc2^ yoeo^,nGfl,A;<-i}. (3.9) 

fe<0 

The gradient of a function H : L{q) — >■ C is defined through the pairing (3.3). 
For X,Y € L{q) 

{VH{X)\Y) = ^H{X + Ty)|,=o. (3.10) 
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Hamiltonian flows on L{q) correspond to the Lax equation 

X = [VH{X),X] (3.11) 

If H is Ad-invariant then the vector-field (3.10) vanishes. On the other hand, 

Ad-invariant Hamiltonians define non-trivial vector fields through the r-matrix 
induced by the splitting (3.6). By the Adler-Kostant-Symes theorem, these 
vector fields correspond to the Lax equation 

X = ±[X,VH{X)^]. (3.12) 

We now show that (1.22) is of the form (3.12) for a suitable Ad-invariant 
Hamiltonian on L{q). Let / denote the flux in the scalar conservation law (1.2), 
and consider its antiderivative 

Fis) = / fir)dr. (3.13) 
Jo 

Define the Hamiltonian Hp ■ L{q) — >■ C 

Hf{X{z)) = {F{X{z)z-^)\z^) = ^ (f F{X{z)z-')zdz. (3.14) 

^•"■i J|z| = l 

If / is a polynomial, the second equality follows from Cauchy's theorem. The 
general case follows by approximation. The Hamiltonian Hp is distinct from 
the Hamiltonian of Theorem 1, and in some sense is more natural. A few 
calculations (see [7, p. 94]) then yield that Hp is Ad-invariant and 

VHf{X) = f{Xz-^)z. (3.15) 

Now consider the diagonal matrices M. and M as in (1.27), and consider the 
finite-dimensional subspace V of L{q)^ consisting of linear polynomials of the 
form 

V = {XeL{Q)+\X{z) = zM-A). (3.16) 
A direct computation based on the definition of Hp then yields 

{VHf{X))^ = zN +B (3.17) 

where B is as in (1.20). Thus, the Hamiltonian flow deflned by Hp on V is 
exactly (1.29), which is of course, identical to (1-22). 

Wc now see that every flux / gives rise to a Hamiltonian flow as in (1.29). 
Since each of these Hamiltonians is Ad-invariant, they are all in involution. One 
may now count the number of integrals and invoke the Liouville theorem. Here 
we linearize the flow via the AKS theorem. 

Theorem 5. Let £ m and ^0(2;) = zM — A^. Let g±{t) denote the smooth 
curves in Q± which solve the factorization problem 

exp(-iVifF(^o)) = g+{t)-'g-{t), g±{0) = Id, (3.18) 
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for t in a maximal open interval I containing 0. Then the solution Xf, t €: I to 
(1.29) with initial condition Xq is given by 

Xt = Adg^^t)Xo = Adg_^t)Xo. (3.19) 

Note that a solution to the factorization problem always exists for \t\ small, 
thus a maximal interval of existence for a smooth solution to the factorization 
problem is well-defined. It is also well-known that this factorization problem is 
equivalent to a Riemann-Hilbert problem (see [12, Ch. 3.5]). 

3.2 An operator factorization problem 

While Theorem 5 apphes only to the discrete Lax equations (1.22), one may 
easily guess the associated factorization problem for (1.5). Recall that the gen- 
erators A and B are integro-differcntial operators defined by (1.4) and (1.6). As 
N oo, the N X N diagonal matrices Ai and TV of Theorem 5 are replaced by 
multiplication operators that act on test functions via 

M^piu) = uifiu), Mif{u) = f{u)Lp{u). (3.20) 

The crucial algebraic relation (1.28) continues to hold. 

[A,Ar\-[M,B]=Q. (3.21) 

Formally, this is all that is required to embed (1.5) in a loop-group and we now 
find the factorization problem 

exp(-tVFF(Xo)) = g+{t)-^g-{t), g±{0) = Id, (3.22) 

with Xo = zM. — A. Rather than develop these ideas in formal generality, let 

us mention one interesting example. Assume wc consider Burgers equation with 
Brownian motion initial data. Then the Hamiltonian is Hf{s) = s^/6 and the 
generator of initial data is ■4o<p(w) = —ip"{u)/2. Then we find 

^HAXo) = l{zM-l^)\ (3.23) 

For 2; e M, this is the square of the Airy operator. To the best of our knowledge, 
the factorization suggested by (3.22) is new. 

4 Scattering and inverse scattering theory 

In this section we develop a scattering and inverse scattering theory for the 
discrete zero-curvature equations (1.21). 

dtA-d,,B =[A,B]. 
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The linear problem that underlies the scattering theory of (1.21) is as follows. 
Assume that M is a, fixed diagonal matrix as in (1.27) and A G L^(Ili, m). We 
consider a fundamental matrix for the linear equation 

= iP {zM + A) , xeR, (4.1) 

such that ip{x, z) ~ e^^^ as a; — >■ — oo. Such solutions are called wavefunctions 
(wc use the terminology of [45] ) . The scattering theory for this equation when A 
is a matrix that vanishes on the diagonal was considered by Zakharov et al [48] 
and by Beals and Coifman [13, 14]. In our work, ^ e m. As a consequence, even 
though A is completely determined by its ofF-diagonal elements, its diagonal en- 
tries do not vanish. Thus, the scattering theory of [13, 14] does not immediately 
apply to our model and we have to rederive some results. For the most part, 
this is straightforward. To prevent too much repetition, wc state the results we 
need, and present the main calculations that explain how the results of [13, 14] 
are to be modified in Section 4.5 and Section 4.6. 

A short outline of the results of this section is as follows. The scattering 
theory is addressed in Section 4.1. Theorem 6 and Theorem 7 associate spectral 
data to A e i^(M, m). We consider the inverse scattering theory in Section 4.2 
and Theorem 8. The time evolution of spectral data and the Cauchy problem 
is considered in Section 4.3. The linear evolution of spectral data is stated 
in (4.11). The combination of inverse scattering theory and evolution yields 
several well-posedncss theorems for (1.21). Finally, wc construct a hierarchy of 
integrable models as in the ZS-AKNS hierarchy in Section 4.4. It is not apparent 
to us that these have intrinsic probabilistic significance, but it is interesting to 
note that one may construct other integrable flows on 9Jt with little effort. 

All results are rigorous ioi N x N matrices and have a natural, but formal, 
extension to the integro-differential operators A and B. The linear evolution of 
the spectral data for these operators remains (4.11) with M and M replaced by 
the multiplication operators (3.20). However, we are unaware of rigorous results 
on the inverse spectral problem for such operators, and a full well-posedness 
theorem for (1.8) via inverse scattering requires further study. 

4.1 Scattering theory- 
It is more convenient to work with the new variable 

m(x,z) = e-^-^="V(a;,^). (4.2) 

m satisfies the linear equation 

nix = z[m,M] + mA (4.3) 

Solutions to (4.3) such that ]]m(-, z)]]/,oo(k) < oo and m{x,z) — / as x — ;> — oo 
are called global reduced wave functions. 

Theorem 6 and Theorem 7 below closely follow Beals and Coifman [13]. Let 
E = denote the imaginary axis in the complex 2;-plane, and let P denote the 
set of maps A G L^(]R, m). We call these maps potentials and the subclass Pq 
below generic potentials. 
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Theorem 6. [13, Thm. A] (a) Suppose A G P. There is a bounded discrete set 
Z C C\S such that m{-,z) is a unique global reduced wave function for every 
z e C\(S U Z). Moreover, m{x, •) is meromorphic in with poles precisely 

at the points of Z and lim^^oo rii{x, z) = I. 

(b) There is a dense open set Pq C P such that if A gPq then 

1. Z is finite. 

2. The poles of m{x, ■) are simple. 

3. Distinct columns of m{x, •) have distinct poles. 

4. m{x,-) admits limits m^{x,) as z ^ Y\Z from the left and right half- 
planes. 

Theorem 7. [13, Thm. B] (a) Suppose A e Pq- For z € T, there is a unique 

matrix v{z) such that for every .2; G K 

m+{x,z) = e-^^-^i;(z)e^^-^m-(a;,z). (4.4) 

(b) For each pole Zj G Z, there is a matrix v{zj) such that the residue of m 
satisfies 

Resimix,-y,Zi) = lim e-''''^v{z)e'"'^mix, z). (4.5) 

(c) The generic potential A is uniquely determined by the jump matrix v{z), 
z and the residues Res(m(a;, ■):,Zj), zj G Z. 

4.2 Inverse scattering theory 

The jump matrix v{z), the poles Z = {zi, . . . , Zm} and the residues v{zj) con- 
stitute the scattering data. The reconstruction of A from the scattering data is 
the inverse spectral problem. Though Theorem 7 guarantees that the scattering 
data associated to a generic potential is unique, this assertion is proved via an 
application of Liouville's theorem and is not constructive. What is required is 
a constructive procedure to obtain A given the scattering data. 

In order to state the inverse spectral theorems, we work with potentials 
that lie in the Schwartz class <S(M, m). This assumption is not necessary, but it 
simplifies the exposition. Analogous finite regularity results can also be obtained 
as in [13]. 

Theorem 8. (a) Suppose A G 5(M,m). Then there is R> and C°° functions 
m'-^^ : M 0, A: = 0, 1, . . . such that 

00 

m(a;,0) = ^^"W=^(a;), x&W,\z\>R, (4.6) 
fe=o 

and the series converges uniformly in x and z. 
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(b) The coefficients m^'^^ may he determined recursively. In particular, m^^^ 
is a diagonal matrix with entries 




J — oo 



and the off-diagonal entries of m^^^ are given by 




(4.8) 



(c) The asymptotic expansion (4-6) may also be written 



oo 



m{x, z) = m' 



,(°)(a;)ft(a;,z), h{x,z) = ^z-^h^''\x), x G R, jz] > i?, (4.9) 



where h''^\x) = /, and h^^\ k>l, are in the Schwartz class iS(]R, g). 

Recall that here ui < U2 < ... < un are the diagonal entries of M.. Part 
(b) of the theorem allows us to uniquely reconstruct the potential. Assume 
given a global reduced wave function m with the asymptotic expansion (4.6). 
Then the off-diagonal terms of A arc given by (4.8), and the diagonal terms are 
given by the relation Ajj = — J2k=^j ^jk- It is necessary to assume that m is a 
reduced wave function: an arbitrary set of functions m^''\x) is not admissible. 
Indeed, the constraint Ajj = — '^k=ij -^jk implies many relations between the 
coefficients m'-'^-' . For example, we have 



The full inverse scattering problem relates the scattering data to the poten- 
tial A. In light of Theorem 8, it suffices to reconstruct m from the scattering 
data. Since m is holomorphic in C\(S U Z) it is expressed in terms of the 
scattering data by Cauchy integrals. The associated integral equations are in- 
dependent of our assumption that A e m, and the results of [13] relating m 
and the scattering data apply directly. The subtlety is that the scattering data 
satisfies both algebraic, analytic and topological constraints. For example, these 
may be constraints involving the zeros and winding numbers of principal minors 
of V [13, Thm D]. For generic potentials that satisfy these constraints, m and the 
scattering data are related by Cauchy integrals that preserve the Schwartz class. 
For such scattering data, the inverse scattering problem is solved by mapping 
the scattering data to m via Cauchy integrals, and then mio A via Theorem 8. 
In the simplest situation, Z is empty, and the wave function is reconstructed 
from the jump on S alone. 




(4.10) 
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4.3 Evolution of scattering data and the Cauchy problem 

Wc now combine the x and t dependence, and consider the Cauchy problem for 
the discrete zero-curvature equations (1.21) with initial data A{x,0) = Ao{x) G 
m. Let vo{z), z € S and vo{zj) denote the scattering data of Aq. The scattering 
data evolve by the simple linear equations 

[zAf,v{z)], 2GE (4.11) 
[zJ\f,v{zj)], ZjGZ. (4.12) 

v{zj,t) = e''^^voizj)e-''^^. (4.13) 

The evolution is formally stable in the terminology of [14, §3.12]). We may 
now combine Theorem A and Theorem C of [14] to obtain the following basic 
well-posedness theorem for (1.21). 

Theorem 9. Assume Aq is a generic potential in (S(IR,m) with associated scat- 
tering data vo{z), 2; e S and vo{zj), z € Z. Then there is T > and a unique 
smooth map [0,T) 5(R, m), t i->- A{-,t), such that the scattering data of 
A{-,t) is given by (4-13) and A{x,t) solves the Cauchy problem for the discrete 
zero- curvature equations (1-21) with initial data Af). 

In addition, the following dichotomy holds [14, Thm. B] . 

Theorem 10. Under the hypotheses of Theorem 9, suppose T G (0, 00] is max- 
imal. Then either T = 00 or limt^j^ || A(-, t)||^2(]g) = 00. 

In general, the maximal time interval is finite. However, global existence is 
guaranteed if Aq is triangular. If A is triangular, so are B and [A, B] , and [A, B] 
vanishes on the diagonal. Thus 

Tv{A'^[A,B]) =0. (4.14) 

In addition, for A e S{R, m) 

/ Tr {A'^{x)d^B) dx = J2 / ^ij^^^ij dx = 0. (4.15) 

Thus, the evolution of (1.21) is dissipative (see [14, §1.11]) and we have global 
existence. 

Corollary 1. Assume the hypotheses of Theorem 9 and assume in addition that 
Aq{x) is triangular for every x gR. Then T = 00. 

It is surprising that we do not need to assume that A is everywhere lower 
triangular or everywhere upper triangular. The assumption that A is triangular 



dv{z) 

dt 

with the unique solution 

v{z,t) = e''^va{z)e-''^, 



22 



pointwisc is enough to ensure (4.14) for every a; G R, which in turn imphes 
dissipativity. 

Finally, let us connect these results with the probabilistic context that moti- 
vated us. In order to ensure that A is truly a generator, we must ensure that the 
off-diagonal terms are positive. Since smooth solutions exist, this is preserved 
at least for a short time. However, it is more subtle to ensure global existence. 
Here the convexity of / plays an important role. 

Theorem 11. Assume the hypotheses of Theorem 9. In addition, assume that 
f is convex, and that Ao{x) is the generator of a spectrally negative Markov 
process. Then T = oo and A{x, t) remains the generator of a spectrally negative 
Markov process for every t> Q. 

Proof. Equation (1.21) may also be solved by the method of characteristics. 
Indeed, Bij = FijAij, thus each entry Aij evolves on a characteristic with 
speed —Fij. The characteristic speed is simply the Rankine-Hugoniot condition 
associated to the shock connecting states Ui and uj. We integrate (1.21) on 
characteristics to find 

Aij{x,t) = {Ao)ij{x + Fijt)+ [ [A,B]ij{x + Fijs,s)ds. (4.16) 

Since Ui < ... < um and / is convex, we now find exactly as in the proof 
of Theorem 3 that [A,B]ij > 0, i > j. The diagonal terms are conserved 
on characteristics since [A, Bju = since B = F o A. Similarly, the upper- 
triangular part of [A, B] vanishes if A and B are lower- triangular. A simple 
maximum principle argument shows that A remains lower-triangular. □ 

The integral equations (4.16) can also be used to give a direct proof of global 
existence of solutions without the assumption that Aq is generic. Since A^j > 
on the off-diagonal, and An is conserved, we have the bound ||^ij(-, f)||j;,oo(]K) < 
||(^o)m||l°=(r)) which ensures global existence. 

4.4 The ZS-AKNS hierarchy 

The discrete zero-curvature equations (1-21) are part of a hierarchy of commut- 
ing Hamiltonian flows. The existence of such hierarchies was established in the 
pioneering work of Zakharov and Shabat [49] and Ablowitz, Kaup, Newell and 
Segur [2]. We now derive the associated hierarchy for (1.21): as expected this 
is a modification of the hierarchy for the N-wa.ve model. Our calculations and 
notation follow [45]. 

We fix a diagonal matrix A/" and consider the asymptotic behavior oiQ{x, z) = 
m~^Afm as z — > 00. By Theorem 8(c), we have m~^Afm = h~^JVh, and the 
expansion (4.9) yields 

oo 

Q{x, z) = h-^Nh ~ Q('=)^-^ z^oo. (4.17) 

fe=0 
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We call Q^'^^ the fc-th flux. It admits an expansion 

= z''Af + z'^-^Bi + ^^-2^2 + ... + Bk, Bfe e m. (4.18) 
Definition 12. The fc-th flow in the hierarchy is given by the equation 

diA-d.^Q^''^ = [A,Q^% k>0. (4.19) 
The zero-curvature equation (4.19) may also be written in the Lax form 



d^ + zM + A,dt + Q 



(fc) 



= 0. 



(4.20) 



The fc-th flux is obtained as follows. We show below that Q satisfies the linear 
equation 

Q,^[Q,zM + A]. (4.21) 
The asymptotic expansion (4.17) now yields the hierarchy of linear equations 

= [Q^°\M], 



and 



Since h{x, z) 



- z^'^h'''^'' as z -H- cxD, we have 
Q{x,z)^M+ ^-^'^^'V ..., z^oo. 



(4.22) 
(4.23) 

(4.24) 



Thus, Q(°) = N is the solution to (4.22). For /c > 1 the ansatz (4.18) yields 
k + 2 linear equations for Bj. We use (4.18) and (4.20) to obtain the equations 



0{z^),k' 



0{z^+^) 
0{z'') 

1 > i > 1 

0(1) 



[M,J\f]=0, (4.25) 
d^Af+[A,J\f] + [M,Bi]=0, (4.26) 

d^Bk-j + [A, Bk-j] + [M,Bk+i-j] = 0, (4.27) 
d^Bk-dtA + [A,Bk]=0. (4.28) 



The 0(z''^^) equation is trivially satisfied since J\f is diagonal. Since JV is 
independent of a;, the 0{z'') equation generalizes equation (1.28). When k = 1 
this yields the off'-diagonal terms of Si = B in accordance with (1.20). For 
fc > 1 wc recursively solve (4.25) until wc obtain Bk- This is very similar to 
the recursion for the classical N-wave model with one important difference. 
When solving (4.25) recursively, we realize that the 0{z) term yields only the 
off-diagonal terms of Bk- In the earlier work of Zakharov and Manakov, the 
diagonal terms of B^ vanished because of the assumption that B^ G u. Here 
these terms suffice to determine Bk since B). G m. 

The fc-th flow may be solved by the inverse scattering method. The scattering 
data evolves by 



v{z,t) = e'''^vo{z)e 



v{zj,t) = e''''i-'^vo{zj)e 



(4.29) 
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Well-poscdncss of the fc-th flow requires that the evolution is formally stable [14, 
3.12]. In our case, this requires Re{z'^JVjj ) = for z G S and each diagonal entry 
of J\f. Since E is the imaginary axis, this is satisfied for all odd k when Af is 
real (in particular, for Af given by diag(/(7ii), . . . , /(um))- For even fc, we need 
to choose J\f purely imaginary, and we find that Bk is purely imaginary if A is 
real. This is incompatible with Asm. For odd k, Theorem 9 and Theorem 10 
hold with (4.13) replaced by (4.29) and (1.21) replaced by (4.19). It is not clear 
if these equations have a true probabilistic interpretation. 



4.5 Proofs of Theorem 6 and Theorem 7 

We now present the calculations and matrix factorization theorem that underlie 
Theorem 6 and Theorem 7. To this end, it is enough to assume that A is C°° 
with compact support in x. A density argument as in [13] yields the conclusions 
for A G L-^(M, m). To construct a globally bounded solution m{x,z) we assume 
that A has compact support and solve the initial value problem 

mx = z[m, M] + mA, rh{x, z) = I,x 0. (4.30) 

It is clear that (4.30) has a unique solution that is holomorphic in z. In addition, 
since nix = z[fa,A4], x ^ 0, there exists a holomorphic matrix s{z) such that 

rh{x, 0) = I ll.xMg^^^^z.M^ I J l[ (4-31) 
Observe also that m is always invertible because 

det(TO)(a;, z) = exp ( / Tr(A(s)) ds] . (4.32) 



We seek a boimded solution to (4.3) of the form m{x,z) = g{x, z)rh{x, z). 
Since m solves (4.3) and m solves (4.30) we find that g{x, z) = e~^^-^S((2;)e^^-'^ 
for some matrix g{z). The asymptotic behavior of m in (4.31) then implies 

( ^ _ / e-^^-^s((.2)e^^-^, a; < 0, 
m[,x,z)-<^ ^_,xMg^^^g(^^^^zxM^ ar>0. ^^"^'^^ 

It only remains to choose g so that m is globally bounded in x for fixed z. 

First assume z is in the left-half plane. Recall that M = diag(?ii, . . . ,Um) 
with Ml < 7i2 < • • • < u,!,. Thus, m,jk{x, z) = gjk{z)e^^^'^''^^^^ for x -C 0. Since 
Ile{z) > 0, nijk is bounded only if gjk — for ) < k. Thus, g is lower-triangular. 
We next find that m is bounded as a; — >■ oo only if g(z)s{z) is upper-triangular. 
Finally, since m — > / as .x — oo, we see that the diagonal entries of g are all 1. 
In order to choose g in accordance with these constraints, recall that Gaussian 
elimination may be written as the matrix factorization 

s{z) = L{z)D{z)U{z) (4.34) 
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where L and U arc lower and upper triangular matrices that are 1 on the 
diagonal and D{z) = diag(det(si(2:), . . . , Sm{z)) where Sk{z) denotes the k x k 
upper-block of s{z) [26, Thm 1.1]. By construction, L, D and U are unique 
except at the zeros of det(sfe(z)), k = l,...,m. Since s is entire, this set is 
discrete. We then choose g~ — {z) = L(z)~^ , the superscript denoting the left- 
half plane. By construction, g is meromorphic in the left-half plane. 

A similar calculation in the right-half plane Re(z) > reveals that g{z) 
must be upper-triangular and g{z)s{z) must be lower-triangular. In this case, 
we factorize 

s{z) = L{z)D{z)U{z) (4.35) 
where now D ~ diag(det(si(z), . . . , Sm{z)) and Sk{z) denotes the k x k lower- 
block of s{z). We now find g~^{z) = U{z) 

Let Z denotes the set of zeros of det(sfc(z)) and det(sfc(z)), k = 1,. . . ,m. 
The factorizations g^{z) are continuous on ^; e T,\Z. To summarize, we have 

m^{x, z) = e-^^-^5=^(^)e^^^m(a;, z), e K, ^ G C\Z (4.36) 

where m{x, z) is entire in z, g"^ are obtained by factorizing s as in (4.34) and 
(4.35). In order to obtain the scattering data, we isolate the jump in m on S. If 
we set ^p{x,z) = m+(m^)^^, a; G M, z e S, we find ipx = z[ip,M.\. Thus, there 
exists a matrix v{z) such that ip{x,z) = e~^^^w(2)e^^-^ and we have 

m+{x, z) = e-^^^w(2)e^^-^m-(a;, z), x&%z& Y\Z. (4.37) 

These are the main calculations needed to establish the existence of the jump 
measure, and it is clear that the assumption A G m (as opposed to Ajj = 0) has 
played only a minor role (e.g. (4.32) has replaced det(m) = 1.). The arguments 
in [13, pp. 48-49] are similarly modified to yield Theorem 6 and Theorem 7. 



4.6 Proof of Theorem 8 

Theorem 8 is a modification of [13, Thm 6.1]. The main differences are that 
mo is no longer the identity, and we have to solve separately for diagonal and 
off-diagonal terms. It is simplest to postulate an expansion of the form (4.6) 
and solve for the terms rrij . One may then justify the expansion for A that is 
suitably regular as in [13, §6]. 

Assume (4.6) holds and m{x, z) ^ I as x ^ — oo. We substitute this ansatz 
in (4.3) to find the hierarchy of equations 

0=[m^°\x),M], (4.38) 

and 

-mkA=[m^>'+^\x),M], fc = 0,l,... (4.39) 

ax 

Equation (4.39) implies that on the diagonal 




(4.40) 
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On the off-diagonal 



dx 



-(mW^). =K-«i)m!r'\ *^J- (4.41) 



Equation (4.38) implies that m^*^^ is a diagonal matrix. We then solve (4.40) 
with fc = to obtain (4.7). It is simplest to solve the rest of the hierarchy by 
making the ansatz 

oo 

m{x,z)=m^°^^z-^h^^\x,z)=m^°^h{x,z). (4.42) 

fe=0 

Let Ad and Ao denote the diagonal and off-diagonal terms of A. We substitute 
(4.42) in (4.3) to find 

K = [h, zM + Ad] + hAo. (4.43) 
Then we have the hierarchy of linear equations 

0=[h^°\M], (4.44) 

and 

4*=) - [h'^''\Ad\ - h^'^^Ao = [h^''+^\M\, k>0. (4.45) 

Equation (4.44) implies that h^^^ is diagonal. We then consider the diagonal 
terms of (4.45) with A; = to find = 0. Since lima;^_oo hi-'-- ^) = this 
implies /i^") = / as expected. The off-diagonal terms of (4.45) with A; = may 
be solved algebraically and yield 

■' Ui — Uj 

This process can be continued indefinitely. At each step, we first solve a 

difFcrcntial equation that yields the diagonal terms of h'^'^^ , and then an algebraic 
equation that yields the off-diagonal terms of h^'^'^^K For example, we find 

and for the off-diagonal terms of h^'^^ 

^(2) ^ 1 I AjkAkj ^(1)^. . _^ AjjAu - Ajj) _ d^Ajj \ 

Ui-Uj \~'.Ui-Uk " Ui-Uj Ui-Uj j' 

This process becomes increasingly unwieldy, but at every step h^^"> is expressed 
as a finite number of integro-differential terms oi A. If A is in the Schwartz 
class so is h^'^^ for > 1. This is enough to establish the uniform convergence 
of Theorem 8. 
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